智能论文笔记

Semi-Supervised Domain Adaptation by Similarity based Pseudo-label Injection

Abhay Rawat , Isha Dua , Saurav Gupta , Rahul Tallamraju

分类：计算机视觉

2022-09-05

半监督域适应性（SSDA）中的主要挑战之一是标记源和目标样本数量之间的偏差比，导致该模型偏向源域。 SSDA中的最新作品表明，仅将标记的目标样品与源样本对齐可能导致目标域与源域的不完全域对齐。在我们的方法中，为了使两个域对齐，我们利用对比的损失，使用来自两个域的监督样本学习语义上有意义的域不可知特征空间。为了减轻偏斜标签比率引起的挑战，我们通过将其特征表示形式与来自源和目标域的标记样品的特征表示形式进行比较，为未标记的目标样本进行了伪造。此外，为了增加目标域的支持，在训练过程中，这些潜在的嘈杂的伪标签逐渐被逐渐注入标记的目标数据集中。具体而言，我们使用温度缩放的余弦相似性度量将软伪标签分配给未标记的目标样品。此外，我们计算每个未标记样品的软伪标签的指数移动平均值。这些伪标签逐渐注入或删除）（从）基于置信阈值（以补充源和目标分布的比对）（从）中（从）中。最后，我们在标记和伪标记的数据集上使用有监督的对比损失来对齐源和目标分布。使用我们提出的方法，我们在SSDA基准测试中展示了最先进的性能-Office-Home，Domainnet和Office-31。

translated by 谷歌翻译

Robust Learning Protocol for Federated Tumor Segmentation Challenge

Ambrish Rawat , Giulio Zizzo , Swanand Kadhe , Jonathan P. Epperlein , Stefano Braghin

分类：机器学习 | 计算机视觉

2022-12-16

In this work, we devise robust and efficient learning protocols for orchestrating a Federated Learning (FL) process for the Federated Tumor Segmentation Challenge (FeTS 2022). Enabling FL for FeTS setup is challenging mainly due to data heterogeneity among collaborators and communication cost of training. To tackle these challenges, we propose Robust Learning Protocol (RoLePRO) which is a combination of server-side adaptive optimisation (e.g., server-side Adam) and judicious parameter (weights) aggregation schemes (e.g., adaptive weighted aggregation). RoLePRO takes a two-phase approach, where the first phase consists of vanilla Federated Averaging, while the second phase consists of a judicious aggregation scheme that uses a sophisticated reweighting, all in the presence of an adaptive optimisation algorithm at the server. We draw insights from extensive experimentation to tune learning rates for the two phases.

translated by 谷歌翻译

Design of an All-Purpose Terrace Farming Robot

Vibhakar Mohta , Adarsh Patnaik , Shivam Kumar Panda , Siva Vignesh Krishnan , Abhinav Gupta , Abhay Shukla , Gauri Wadhwa , Shrey Verma , Aditya Bandopadhyay

分类：机器人

2022-12-04

Automation in farming processes is a growing field of research in both academia and industries. A considerable amount of work has been put into this field to develop systems robust enough for farming. Terrace farming, in particular, provides a varying set of challenges, including robust stair climbing methods and stable navigation in unstructured terrains. We propose the design of a novel autonomous terrace farming robot, Aarohi, that can effectively climb steep terraces of considerable heights and execute several farming operations. The design optimisation strategy for the overall mechanical structure is elucidated. Further, the embedded and software architecture along with fail-safe strategies are presented for a working prototype. Algorithms for autonomous traversal over the terrace steps using the scissor lift mechanism and performing various farming operations have also been discussed. The adaptability of the design to specific operational requirements and modular farm tools allow Aarohi to be customised for a wide variety of use cases.

translated by 谷歌翻译

Review of Methods for Handling Class-Imbalanced in Classification Problems

Satyendra Singh Rawat , Amit Kumar Mishra

分类：机器学习

2022-11-10

Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more important class, is nevertheless represented by a smaller proportion of examples. Using this kind of data could make many carefully designed machine-learning systems ineffective. High training fidelity was a term used to describe biases vs. all other instances of the class. The best approach to all possible remedies to this issue is typically to gain from the minority class. The article examines the most widely used methods for addressing the problem of learning with a class imbalance, including data-level, algorithm-level, hybrid, cost-sensitive learning, and deep learning, etc. including their advantages and limitations. The efficiency and performance of the classifier are assessed using a myriad of evaluation metrics.

translated by 谷歌翻译

Large Language Models with Controllable Working Memory

Daliang Li , Ankit Singh Rawat , Manzil Zaheer , Xin Wang , Michal Lukasik , Andreas Veit , Felix Yu , Sanjiv Kumar

分类：自然语言处理 | 人工智能 | 机器学习

2022-11-09

Large language models (LLMs) have led to a series of breakthroughs in natural language processing (NLP), owing to their excellent understanding and generation abilities. Remarkably, what further sets these models apart is the massive amounts of world knowledge they internalize during pretraining. While many downstream applications provide the model with an informational context to aid its performance on the underlying task, how the model's world knowledge interacts with the factual information presented in the context remains under explored. As a desirable behavior, an LLM should give precedence to the context whenever it contains task-relevant information that conflicts with the model's memorized knowledge. This enables model predictions to be grounded in the context, which can then be used to update or correct specific model predictions without frequent retraining. By contrast, when the context is irrelevant to the task, the model should ignore it and fall back on its internal knowledge. In this paper, we undertake a first joint study of the aforementioned two properties, namely controllability and robustness, in the context of LLMs. We demonstrate that state-of-the-art T5 and PaLM (both pretrained and finetuned) could exhibit poor controllability and robustness, which do not scale with increasing model size. As a solution, we propose a novel method - Knowledge Aware FineTuning (KAFT) - to strengthen both controllability and robustness by incorporating counterfactual and irrelevant contexts to standard supervised datasets. Our comprehensive evaluation showcases the utility of KAFT across model architectures and sizes.

translated by 谷歌翻译

Intrusion Detection Systems Using Support Vector Machines on the KDDCUP'99 and NSL-KDD Datasets: A Comprehensive Survey

Mikel K. Ngueajio , Gloria Washington , Danda B. Rawat , Yolande Ngueabou

分类：人工智能 | 机器学习

2022-09-12

随着网络攻击和网络间谍活动的增长，如今需要更好，更强大的入侵检测系统（IDS）的需求更加有必要。 ID的基本任务是在检测Internet的攻击方面充当第一道防线。随着入侵者的入侵策略变得越来越复杂且难以检测，研究人员已经开始应用新颖的机器学习（ML）技术来有效地检测入侵者，从而保留互联网用户对整个互联网网络安全的信息和整体信任。在过去的十年中，基于ML和深度学习（DL）架构的侵入检测技术的爆炸激增，这些架构在各种基于网络安全的数据集上，例如DARPA，KDDCUP'99，NSL-KDD，CAIDA，CAIDA，CTU--- 13，UNSW-NB15。在这项研究中，我们回顾了当代文献，并提供了对不同类型的入侵检测技术的全面调查，该技术将支持向量机（SVMS）算法作为分类器。我们仅专注于在网络安全中对两个最广泛使用的数据集进行评估的研究，即KDDCUP'99和NSL-KDD数据集。我们提供了每种方法的摘要，确定了SVMS分类器的作用以及研究中涉及的所有其他算法。此外，我们以表格形式对每种方法进行了批判性综述，突出了所调查的每种方法的性能指标，优势和局限性。

translated by 谷歌翻译

Teacher Guided Training: An Efficient Framework for Knowledge Transfer

Manzil Zaheer , Ankit Singh Rawat , Seungyeon Kim , Chong You , Himanshu Jain , Andreas Veit , Rob Fergus , Sanjiv Kumar

分类：机器学习

2022-08-14

大型预估计模型（例如GPT-3）取得了显着的性能，在训练过程中暴露于大量数据上。类似地，将如此大型模型提炼成紧凑的模型以进行有效的部署，也需要大量（标记或未标记的）培训数据。在本文中，我们提出了培训高质量紧凑型模型的教师指导培训（TGT）框架，该模型利用了预验证的生成模型获得的知识，同时避免了大量数据的需求。 TGT利用了教师获得基础数据域的良好表示的事实，该事实通常对应于比输入空间要低得多的尺寸歧管。此外，我们可以使用老师通过采样或基于梯度的方法来更有效地探索输入空间。因此，使TGT对于有限的数据或长尾设置特别有吸引力。我们正式在我们的概括范围内正式捕获了所提出的数据域探索的好处。我们发现TGT可以提高几个图像分类基准以及一系列文本分类和检索任务的准确性。

translated by 谷歌翻译

Real-time Caller Intent Detection In Human-Human Customer Support Spoken Conversations

Mrinal Rawat , Victor Barres

分类：人工智能

2022-08-14

在人类人类客户支持语音互动期间的代理协助需要根据呼叫者的意图触发工作流程（通话原因）。预测的及时性对于良好的用户体验至关重要。目的是使系统在代理商能够检测到它时检测呼叫者的意图（意图边界）。一些方法着重于预测离线输出，即，一旦ASR系统处理了完整的口语输入（例如，整个对话转弯）。每当在转弯中早些时候可以检测到意图时，这会引入预测中的不良延迟。关于语音助手的最新工作已在单词层面上使用增量实时预测，以在命令结束之前检测意图。但是，人指导和机器指导的语音具有截然不同的特征。在这项工作中，我们建议将一种在语音辅助方面开发的方法应用于在线实时呼叫者在人类口语互动中的意图检测问题。我们使用双重体系结构，其中两个LSTM共同训练：一个预测意图边界（IB），然后预测IB处的意图类别。我们在私人数据集上进行实验，其中包括来自电信客户支持域的人类电话交谈的成绩单。我们报告结果分析了系统的准确性以及不同体系结构对整体准确性和预测潜伏期之间的权衡的影响。

translated by 谷歌翻译

Federated Learning for Medical Applications: A Taxonomy, Current Trends, and Research Challenges

Ashish Rauniyar , Desta Haileselassie Hagos , Debesh Jha , Jan Erik Håkegård , Ulas Bagci , Danda B. Rawat , Vladimir Vlassov

分类：机器学习 | 计算机视觉

2022-08-05

随着物联网，AI和ML/DL算法的出现，数据驱动的医疗应用已成为一种有前途的工具，用于从医学数据设计可靠且可扩展的诊断和预后模型。近年来，这引起了从学术界到工业的广泛关注。这无疑改善了医疗保健提供的质量。但是，由于这些基于AI的医疗应用程序在满足严格的安全性，隐私和服务标准（例如低延迟）方面的困难，因此仍然采用较差。此外，医疗数据通常是分散的和私人的，这使得在人群之间产生强大的结果具有挑战性。联邦学习（FL）的最新发展使得以分布式方式训练复杂的机器学习模型成为可能。因此，FL已成为一个积极的研究领域，尤其是以分散的方式处理网络边缘的医疗数据，以保护隐私和安全问题。为此，本次调查论文重点介绍了数据共享是重大负担的医疗应用中FL技术的当前和未来。它还审查并讨论了当前的研究趋势及其设计可靠和可扩展模型的结果。我们概述了FL将军的统计问题，设备挑战，安全性，隐私问题及其在医疗领域的潜力。此外，我们的研究还集中在医疗应用上，我们重点介绍了全球癌症的负担以及有效利用FL来开发计算机辅助诊断工具来解决这些诊断工具。我们希望这篇评论是一个检查站，以彻底的方式阐明现有的最新最新作品，并为该领域提供开放的问题和未来的研究指示。

translated by 谷歌翻译

SVGraph: Learning Semantic Graphs from Instructional Videos

Madeline C. Schiappa , Yogesh S. Rawat

分类：计算机视觉

2022-07-16

在这项工作中，我们专注于生成嘈杂的，教学视频的图形表示，以供视频理解。我们提出了一种自制，可解释的方法，该方法不需要任何图形表示的注释，这将是昂贵且耗时的。我们试图通过呈现语义视频图或SVGraph来克服“黑匣子”学习限制，这是一种多模式的方法，它利用叙述来实现学习图的语义解释性。SVGraph 1）依靠多种方式之间的一致性来学习统一的图形结构，并借助跨模式的注意力和2）在语义分配的帮助下分配语义解释，从而从视频叙述中捕获语义。我们在多个数据集上执行实验，并演示语义图学习中SVGraph的解释性。

translated by 谷歌翻译